Detection of polyadenylation signals in human DNA sequences.
نویسندگان
چکیده
We present polyadq, a program for detection of human polyadenylation signals. To avoid training on possibly flawed data, the development of polyadq began with a de novo characterization of human mRNA 3' processing signals. This information was used in training two quadratic discriminant functions that polyadq uses to evaluate potential polyA signals. In our tests, polyadq predicts polyA signals with a correlation coefficient of 0.413 on whole genes and 0.512 in the last two exons of genes, substantially outperforming other published programs on the same data set. polyadq is also the only program that is able to consistently detect the ATTAAA variant of the polyA signal.
منابع مشابه
Development of an Alu-PCR Amplified YAC Probe Suitable for Enumeration of Chromosome 13 on Uncultured Lymphocytes and Amniocytes by Fluorescence in situ Hybridization
The main objective of the present study was to develop an efficient and reliable probe to be routinely used for detection of chromosome 13 copy numbers by interphase FISH. To achieve this, a Yeast Artificial Chromosome (YAC) containing sequences specific for human 13q12 (744D11), was cultured and the whole yeast genomic DNA was extracted. The human insert within the isolated DNA was amplified b...
متن کاملDensity Clustering Based SVM and Its Application to Polyadenylation Signals∗
Support vector machines (SVM) have been promising methods for classification analysis due to their solid mathematical foundations. Clustering-based SVMs are used to solve large samples classification problems and reduce the computational cost. In this paper, we present a density clustering based SVM(DCB-SVM) method to predict polyadenylation signal (PAS) in human DNA and mRNA sequences. We decr...
متن کاملAn in-silico method for prediction of polyadenylation signals in human sequences.
This paper presents a machine learning method to predict polyadenylation signals (PASes) in human DNA and mRNA sequences by analysing features around them. This method consists of three sequential steps of feature manipulation: generation, selection and integration of features. In the first step, new features are generated using k-gram nucleotide acid or amino acid patterns. In the second step,...
متن کاملP-215: Discovery of A Novel APA Variant of A Human Potential Gene Based on Expressed Sequenced Tags Analysis
Background: Expressed sequence tags (ESTs) are sequences of cDNA fragments prepared from different tissue sources. There are over one million of these sequences in the publicly available database, and these sequences are believed to represent more than half of all human genes. The ESTs belong to different cDNA libraries, was prepared from one particular cell type, organ, or tumor. Therefore, th...
متن کاملDNAFSMiner: a web-based software toolbox to recognize two types of functional sites in DNA sequences
UNLABELLED DNAFSMiner (DNA Functional Sites Miner) is a web-based software toolbox to recognize functional sites in nucleic acid sequences. Currently in this toolbox, we provide two software: TIS Miner and Poly(A) Signal Miner. The TIS Miner can be used to predict translation initiation sites in vertebrate DNA/mRNA/cDNA sequences, and the Poly(A) Signal Miner can be used to predict polyadenylat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Gene
دوره 231 1-2 شماره
صفحات -
تاریخ انتشار 1999